首页> 外文OA文献 >Exploring Instruction-Fetch Bandwidth Requirement in Wide-Issue Superscalar Processors
【2h】

Exploring Instruction-Fetch Bandwidth Requirement in Wide-Issue Superscalar Processors

机译:探索宽问题超标量处理器中的指令提取带宽要求

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The effective performance of wide-issue superscalar processors depends on many parameters, such as branch prediction accuracy, available instruction-level parallelism, and instruction-fetch bandwidth. This paper explores the relations between some of these parameters, and more particularly, the requirement in instruction-fetch bandwidth. We introduce new enhancements to boost effectively the instruction-fetch bandwidth of conventional fetch engines. However, experiments strongly show that performance improves less for a given instruction-fetch bandwidth gain as the base bandwidth increases. At the level of bandwidth exhibited by the proposed schemes, the performance improvement is small. This clearly brings to light potential relations between the bandwidth and the other parameters. We provide a model to explain this behavior and quantify some relations. Based on the experimental observation that the available parallelism in an instruction window grows as the square root of the window size, we derive from the model that the instruction fetch bandwidth requirement increases as the square root of the distance between mispredicted branches. We also show that the instruction fetch bandwidth requirement increases linearly with the parallelism available in a fixed-size instruction window. Finally, we review some existing techniques to enhance performance and we describe their impact on the instruction-fetch requirement in the light of the above relations. These techniques include those increasing the amount of instruction-level parallelism (e.g. value-prediction) and those enlarging the effective instruction window (e.g. eager execution).
机译:宽问题超标量处理器的有效性能取决于许多参数,例如分支预测精度,可用的指令级并行性和指令获取带宽。本文探讨了其中一些参数之间的关系,尤其是对指令提取带宽的要求。我们引入了新的增强功能,可以有效地提高传统提取引擎的指令提取带宽。但是,实验强烈表明,对于给定的指令获取带宽增益,随着基本带宽的增加,性能的提高幅度较小。在所提出的方案所展现的带宽水平上,性能改进很小。显然,这揭示了带宽与其他参数之间的潜在关系。我们提供一个模型来解释这种行为并量化一些关系。基于实验观察,指令窗口中的可用并行度随着窗口大小的平方根增长而增加,我们从模型中得出指令提取带宽需求随预测分支之间距离的平方根增加而增加。我们还表明,随着固定大小的指令窗口中可用的并行性,指令提取带宽需求呈线性增加。最后,我们回顾了一些现有的提高性能的技术,并根据上述关系描述了它们对指令获取要求的影响。这些技术包括那些增加指令级并行度(例如,值预测)的技术和那些扩大有效指令窗口(例如,渴望执行)的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号